Logistic Regression in Datastreams

نویسندگان

  • Chris Schwiegelshohn
  • Christian Sohler
  • Katharina Morik
چکیده

Learning from data streams is a well researched task both in theory and practice. As remarked by Clarkson, Hazan and Woodru [12], many classi cation problems cannot be very well solved in a streaming setting. For previous model assumptions, there exist simple, yet highly arti cial lower bounds prohibiting space e cient onepass algorithms. At the same time, several classi cation algorithms are often successfully used in practice. To overcome this gap, we give a model relaxing the constraints that previously made classi cation impossible from a theoretical point of view and under these model assumptions provide the rst (1 + )-approximate algorithms for sketching the objective values of logistic regression and perceptron classi ers in data streams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A NEW APPROACH FOR PARAMETER ESTIMATION IN FUZZY LOGISTIC REGRESSION

Logistic regression analysis is used to model categorical dependent variable. It is usually used in social sciences and clinical research. Human thoughts and disease diagnosis in clinical research contain vagueness. This situation leads researchers to combine fuzzy set and statistical theories. Fuzzy logistic regression analysis is one of the outcomes of this combination and it is used in situa...

متن کامل

Comparison of ordinary logistic regression and robust logistic regression models in modeling of pre-diabetes risk factors

Background: Regarding the increased risk of developing type 2 diabetes in pre-diabetic people, identifying pre-diabetes and determining of its risk factors seems so necessary. In this study, it is aimed to compare ordinary logistic regression and robust logistic regression models in modeling pre-diabetes risk factors. Methods: This is a cross-sectional study and conducted on 6460 people, over ...

متن کامل

File-Based Storage of Digital Objects and Constituent Datastreams: XMLtapes and Internet Archive ARC Files

This paper introduces the write-once/read-many XMLtape/ARC storage approach for Digital Objects and their constituent datastreams. The approach combines two interconnected file-based storage mechanisms that are made accessible in a protocol-based manner. First, XMLbased representations of multiple Digital Objects are concatenated into a single file named an XMLtape. An XMLtape is a valid XML fi...

متن کامل

Comparison of artificial neural network with logistic regression in prediction of tendency to surgical intervention in nurses

Introduction: Logistic regression is one of the modeling methods for bipartite dependent variables. On the other hand, artificial neural network is a flexible method with the least limitation. The importance of growing unnecessary beauty surgeries and the importance of prediction and classification made us consider the present study, with the aim of comparing logistic regression and artificial ...

متن کامل

Factors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis

Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014